APPROXIMATION BY ANALYTIC MATRIX FUNCTIONS. 
THE FOUR BLOCK PROBLEM 
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Abstract. We study the problem of finding a superoptimal solution to the four 

$11 $12 
$21 

circle the four block problem is to minimize the norm of 



block problem. Given a bounded block matrix function { ^ on the unit 

1 $21 *22 ' 



$11 -F $12 
$21 $22 

over F G H°°. Such a minimizing F (an optimal solution) is almost never unique. 
We consider the problem to find a superoptimal solution which minimizes not only 
the supremum of the matrix norms but also the suprema of all further singular val- 
ues. We give a natural condition under which the superoptimal solution is unique. 



1. Introduction 

The problem of approximating a given scalar function (f on the unit circle T uni- 
formly by functions analytic in the unit disk D has been attracting analysts for a long 
time (see [Kha], [RSh], [Ne], [AAKl-2], [CJ], [PKh]). It was shown in [Kha] that for 
a continuous function ip such a best approximation is unique while it is not unique in 
the general case. Later it turned out that this problem is closely related with Hankel 
operators. Namely, it was proved by Nehari [Ne] that 



where if^ : — > if ^ is the Hankel operator with symbol (p defined by 



(we denote by P_|_ and P_ the orthogonal projections onto and H^). Presently 
the problem of approximating by analytic functions in L°° is called Nehari's problem. 

We shall also need the notion of a Toeplitz operator. Given ip G L°° the Toeplitz 
operator : if ^ is defined by 

= P+^/, / e ffl 
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Adamyan, Arov and Krein [AAKl-2] found many interesting connections between 
Hankel operators and Nehari's problem. In particular they found a more general 
condition under which a best approximation is unique: if the essential norm ||^/^(p||g is 
less than ||i?<^||, then </? has a unique best approximation. (Recall that for an operator 
T on Hilbert space 

||r||e = inf{||T -K\\: K is compact}). 

This sufficient uniqueness condition can easily be reformulated in terms of the func- 
tion (/? itself since 

||//^||e = distioo((^, H°° + C) 

(see [AAKl-2]). They also found a criterion of uniqueness of a best approximation 
in terms of the corresponding Hankel operator, and in the case of non-uniqueness 
parametrized all best approximations (optimal solutions of Nehari's problem), see 
[AAKl-2]. However it is not very easy to verify whether a function (p in L°° satisfies 
the criterion. 

Carleson and Jacobs [CJ] studied smoothness properties of the best approximation 
for smooth functions ip. They proved that if ip belongs to the Holder-Zygmund class 
Aa, a > 0, a ^ Z, then the best approximation also belongs to the same class. 

Later in [PKh] more general hereditary properties of the non-linear operator of 
best approximation were studied. For a large class of function spaces X on T it was 
proved that if (/? G X and / is the best approximation by analytic functions, then 
/ e X. Note also that in [PKh] Nehari's problem was also applied in prediction 
theory which led to a new approach to the problem of describing stationary processes 
satisfying various regularity conditions in terms of their spectral densities. 

A new wave of interest in Nehari's problem was caused by the development of 
H°° control theory where Nehari's problem plays a central role (see [Fr]). Moreover 
for the needs of H°° control theory it is important to consider Nehari's problem for 
matrix- valued functions: given an n x m matrix function $ on T the problem is to 
approximate $ by bounded analytic matrix functions Q in the norm 

||$-Q|U'Messsup||$(C)-F(C)||, 

where || ■ || on the right-hand side is the norm of the matrix as an operator from 
C""^ to C". However in contrast with the scalar case we have uniqueness of a best 
approximation only in exceptional cases. Indeed, let 




Clearly distLoo{z, H°°) = 1 and so ||<I> — Q\\oo > 1 for any Q E H'^. However it is 


q 



easy to see that any function of the form ( ^ !! ) with q e H°°, \\q\\oo < a best 



APPROXIMATION BY ANALYTIC MATRIX FUNCTIONS. THE FOUR BLOCK PROBLEM 3 



approximation. Intuitively, however, is clear that the "very best" approximation is 
the zero matrix function O. 

In [Y] Young suggested imposing the following additional assumptions on approx- 
imating functions. Let be the set of best approximations: 

= {g e : Q minimizes ess sup ||$(C) - Q(C)I|}- 

Define inductively the sets flj as follows 

Qj — {Q e Oj-i : Q minimizes esssupSj($(C) — Q{C))} 

CeT 

(for a matrix (or an operator) A the jth singular value Sj{A), j > 0, is the distance 

from A to the set of matrices (operators) of rank at most j, So{A) =^ ||^||)- Elements 
of ^mm{m,n}-i ^rc Called superoptimal approximations of $ (or superoptimal solutions 
of Nehari's problem). Put 

tj = esssups^($(C) - (3(C)), Q e 

CeT 

The numbers tj, < j < min{m, n} — 1, are called the superoptimal singular values. 
As in the scalar case, to = \\H^\\ (the Hankel operator : i7^(C") — >• i7^(C'") is 
defined in the same way as in the scalar case). 

Note that Q is a superoptimal solution of Nehari's problem if and only if it lexico- 
graphically minimizes the sequence {s?°($ — Q)}j>o, where for a matrix function F 
on T 

8f(F) tfesssups,(F(C)). 

CeT 

It was proved in [PYl] that for $ G H°° + C there exists a unique superoptimal 
approximation Q. The method of the proof in [PYl] is based on certain special 
factorizations of matrix functions (thematic factorizations) and it is constructive. 
Later in [T] another method was suggested to establish uniqueness in the H°° + C 
case which is based on weighted Nehari's problem. 

However in the case when the Hankel operator is non-compact, there was no 
analog of the Adamyan-Arov-Krein sufficient condition for uniqueness in the case of 
matrix functions. 

Nehari's problem is a special case of the so-called four block problem which is one 
of the most important problems in control theory. Let $ be a block matrix function 
of the form 



$11 


$12 


$21 


$22 ^ 
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Here $ has size mxn, $11 has size mixrii, and $2 has size m2 x 712. The four block 
problem is to minimize 



(i-i: 



$11 -Q 


$12 


$21 


$22 



over bounded analytic functions Q of size mi x rii. A function Q G iJ°°(Mmini) is 
called an optimal solution of the four block problem if it minimizes the norm (1.1). 

The four block problem arises naturally when one considers the following (model- 
matching) problem in H°° control. Let F, Gi and G2 be matrix functions of class 
H°°. The problem is to minimize 



\F-GiQG2\ 



1.2) 



over Q G H°° (the sizes of the matrix functions in (|1.2| ) are such that (1.2) is mean- 
ingful). Many problems in control reduce to the model-matching problem. 

Engineers usually consider the case of continuous (on T) , or even rational functions 
Gi and G2 and assume that these functions have constant rank on the boundary. Un- 
der this assumption the model-matching problem reduces to the four block problem 
( |1 . 1| ) , while it reduces to Nehari's problem only if the matrices G2, G\ have maximal 



column rank (the rank equals the number of columns). The assumption on the max- 
imal column rank does not hold for many interesting applied problems, so engineers 
have to consider the four block problem as well. 

In the most general case the model matching problem ( |1.2D reduces to the four 
block problem under the assumption that the outer parts of functions Gi and are 
right invertible in L°° (for continuous functions this is equivalent to fact that they 
have constant rank on T). The model mathcing problem reduces to Nehari's problem 
if the outer parts of functions Gi and G2 are invertible in (which for continuous 
functions is equivalent to the above maximal column rank assumption). 

By analogy with Hankel operators we define the four block operator r$ : if^(C"'^) © 
LS^C"^) ^ Hl{C^^) ® L\C""') by 





where F' is the orthogonal projection from L'^{C"^^) © L'^{C"'^) onto if2(C'"i) © 
L^(C'^^). As in the case of Hankel operators the infimum in ( |1 . 1|) is equal to ||r$|| 
(see [FT]). The matrix function $ is called a symbol of the four block operator r$ (a 
four block operator has many different symbols). 

As in the case of Nehari's problem we can define the sets Qj: 



Q G : Q minimizes 



$11 -F 


$12 


$21 


$22 
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flj = <Q E Qj-i : Q minimizes ess sup Sj 

[ C6T 





$12 


$21 


$22 



(C) 



Q is called a superoptimal solution of the four block problem if Q G f^min{mi,ni}-i- 
We define the superoptimal singular values of the four block problem ( |1 . 1| ) by 



tj = ess sup Sj 



$11 -g 


$12 


$21 


$22 



iO], Fen 



r 



Clearly, to = ||r$||. 

We say that a $ is a superoptimal symbol of a four block operator F if F = r$ and 
the zero function is a superoptimal solution of the corresponding four block problem. 

Using a simple compactness argument one can prove easily that a superoptimal 
solution always exists. However we cannot expect a sufficient condition for uniqueness 
of the superoptimal approximation which would be similar to the one found in [PYl] 
in the case of Nehari's problem. Indeed it can easily be proved that a four block 
operator cannot be compact unless $12, $21, and $22 are identically equal to zero in 
which case the four block problem is equivalent to Nehari's problem. 

The main result of the paper (Theorem |2.1|) is a sufficient condition for the four 
block problem to have a unique superoptimal solution. This result can be considered 
as an analog of the Adamyan-Arov-Krein theorem mentioned above which deals 
with Nehari's problem in the scalar case. Note that Theorem |2?l| also gives us a new 
result for Nehari's problem in the case when the corresponding Hankel operator is 
non-compact. 

The proof is constructive. We give an algorithm to find the unique superoptimal 
solution. The algorithm is similar to the one given in [PYl], it reduces the problem 
to the case of matrix functions of lower size. However the proof is considerably more 
complicated than in the case of Nehari's problem with compact Hankel operator. 

In Section 3 we describe briefly the method of factorization and diagonalization. We 
construct important matrix functions V and W which can be considered as analogs 
of the thematic functions deflned in [PYl]. 

In Section 4 we use the construction of Section 3 to parametrize all optimal so- 
lutions of the four block problem. This allows us to reduce the problem of flnding 
superoptimal solutions to the case of matrix functions of lower size. 

Section 5 is devoted to the proof of the fact that the matrix functions Vc and Wc 
which are submatrices of V and W are left invertible in H°°. This is one of the 
principal points in the proof of the main result. 

In Section 6 we prove another crucial fact for the proof of the main result. Namely, 
we show that if $ satisfles the hypotheses of Theorem |2.1| , then the lower order four 
block problem, obtained as a result of parametrization in Section 4, also satisfles 



the hypotheses of Theorem 2.1. This makes it possible to continue the process and 



complete the proof of the main result. 
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In Section 7 we study superoptimal symbols of four block operators. We obtain 
certain special factorizations of such symbols (thematic factorizations), and define 
the indices of such factorizations. In the case of Nehari's problem with compact 
Hankel operators such factorizations were found in [PYl]. 

To prove the invariance of indices we introduce in Section 8 the notion of a super- 
optimal weight for the four block operator. This is an analog of the notion introduced 
in [T] in the case of compact Hankel operators. 

In Section [9] we use superoptimal weights to prove that the sums of the indices in 
a thematic factorization which correspond to equal superoptimal singular values do 
not depend on the choice of factorization. In the case of compact Hankel operators 
this invariance property was proved in [PY2]. Note that as in the case of Nehari's 
problem for H°° + C functions if there are equal superoptimal singular values, the 
indices can depend on the choice of thematic factorization (see [PY2]). 

The last section is devoted to inequalities between the superoptimal singular values 
and the singular values of the four block operator. We obtain an inequality which 
is new even in the case of Nehari's problem with compact Hankel operator. It is 
stronger than the one obtained in [PY2]. 

Note that in [PYl] hereditary properties of the non-linear operator of superoptimal 
approximation were studied. It was shown there that for a large class of function 
spaces X the inclusion $ G X implies that the superoptimal approximant to $ also 
belongs io X. It would be interesting to find analogs of such results in the case of the 
four block problem. In particular we do not know whether the superoptimal solution 
of the four block problem ( |1.1| ) must belong to a Holder class if $ G A^. 

Throughout this paper we shall denote by M^.n the space oi m x n matrices. 
We shall use the notation L°°{Mm,n) and H°^{Mm,n) for the spaces of bounded and 
bounded analytic functions which take values in Mm,n- Sometimes if it does not 
lead to a confusion, we shall simply write and H°° instead of L°°{Mmn) and 
i^~(M„,„)- 

A matrix function G H°° is called inner if 0(C) is isometric for almost all C ^ T. 
A matrix function F G H°°{Mm,n) is called outer ii FH'^{€^) is dense in H'^{€^). A 
function F G H°°{Mm,n) is called co-outer ii the transposed function F* G H°°{Mn,m) 
is outer. 



2. The main result 
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In this section we state the main result of the paper as well as important corollaries. 
Let $ be a matrix function of the form 



$11 


$12 


$21 


$22 



(2.1) 



where $ has size m x n, $ii has size mi x rii, and $22 has size 772,2 x '"-2- Recall 
that r$ is the four block operator defined in Section 1 and {tj} is the sequence of 
superoptimal singular values. 

The following theorem is the main result of the paper. 

Theorem 2.1.. Let ^ be a bounded function of the form ( |2.1| ). Suppose that ||r$||e 
is less than the smallest nonzero superoptimal singular value. Then there exists a 
unique superoptimal solution Q of the four block problem for $. The singular values 



$11 -g 


$12 


$21 


$22 , 



(0 , l<J<d-l, 



are constant on T. 



The following partial case of Theorem improves the result of [PYl] on unique- 
ness of superoptimal solutions of Nehari's problem. 

Theorem 2.2.. Let $ be bounded matrix function on T. Suppose that ||-f^$||c is 
less that the smallest nonzero superoptimal singular value of Nehari's problem. Then 
$ has a unique superoptimal approximation Q by bounded analytic matrix functions. 
The singular values Sj($(C) — <5(C)) o'^c constant on T. 



3. DiAGONALIZATION 



In this section we start with a maximizing vector of the four block operator and 
we construct a certain special unitary-valued matrix. This allows us to achieve a 
diagonalization. Later using this diagonalization we shall reduce the problem to the 
case of a matrix function of a lower size. 

Lemma 3.1.. Let v be annxl inner matrix function. Then there exists a co-outer 
function G if°°(M„^„_i) such that the matrix function 

is unitary-valued on T. 
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In [PYl] a stronger result was obtained. It was shown that all minors of V on the 
first column are in H°° . This property of analyticity of minors was essential for the 
proof of the uniqeness of a superoptimal solution of Nehari's problem which was given 
in [PYl] . Earlier the existence of a co-outer Vc satisfying the requirement of Lemma 
P . 1| was proved in [Va], however the property of analyticity of minors was not noticed 
in [Va]. It also can be shown that if we Vj^-* and V^*-^-* are n x (n — 1) co-outer functions 
satisfying the requirements of Lemma then there exists a constant unitary matrix 
U such that l/Ji) = V^'^'^U (see [Va], [PYl]). 

To start the procedure we need a maximizing vector of the four block operator r$, 
i.e., a nonzero vector / e H^{C^) © ^^(C"^) such that ||r$/|| = ||r$|| ■ ||/||. If $ 



satisfies the hypotheses of Theorem ^71] and r$ ^ O, then ||r$|| = to > IIP* lie and so 
a maximizing vector for P$ exists. 

The following fact is well-known in the case of Hankel operators (see [AAK3]). The 
proof of it in the case of four block operators is similar. 



Lemma 3.2.. Let ^ be a matrix function on T of the form ( p.l| ) and such that 
||$||oo = ||P$||- Suppose that f is a maximizing vector for P$ and put g = tg^P^/. 
Then P$/ = $/ and ||^(C)||c- = ||/(C)l|c" o-e. on T. Furthermore ||$(C)|| = II^IU 
a.e. on T. 

Proof. We have 

||P$/||2 = ||P-$/||2 < ||$/||2 < ||$||oo||/||2 = ||P*|| ■ II/II2 = ||P*/||2. 

It follows that all inequalities in this chain are, in fact, equalities. The fact that 
||p-$||2 = ||$||2 certainly means that $/ G Hl{C^^) © L'^{C^^) and so P$/ = 
The equality ||<I>/||2 = ||$||oo||/||2 implies that ||5'(C)l|c- = ||/(C)||c" a.e. on T, which 
in turn implies that ||$(C)|| = ||$||oo for almost all C G T. ■ 

Lemma 3.3.. Let ^ be a matrix function of the form ( ^.Ij ) such that ||P<i.||c < 
||P$|| . Suppose that f = /i©/2 is a maximizing vector forTq> andt^g = gi®g2 = P$/; 
where /i G H^C"^), /s G L^C"'), gi G HliV^^), and ^2 G ^^(C™^). Then 



ll/i(C)II^.H > ^M^J^II/(C)II^™, 11^71(011^"^. > "^^"iip iil^^"' lk(C)l 

J- <l> J- ll 



almost everywhere on T. 

Proof. By subtracting an optimal solution, we can assume that ||$||oo = IIP^II- By 
Lemma PI, t^g{C) = $(C)/(C) and MQWc^ = ||/(C)l|c" a.e. on T. Therefore /(C) 
is a maximizing vector for $(C) and g{C,) is a maximizing vector for $*(C)- We have 
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92 = [ $21 $22 ) /• It is well known that 



21 



22 



r 



21 



$21 $22 



and so 
Hence 



( $21 $22 



'22 



$22 

— Iir^lle- 

||/(C)llc"<l|r*||e||/(C)llc". 



||r$|| ■ \\g2i()\\c^2 < IK $21 $ 

On the other hand 

ll^?(C)ll^™ = lkl(C)ll^"H + ll^?2(C)ll^"^. = 11/(011^™. 

Therefore 



Ifi'i llc" 



ir^ip 



To prove the inequality for /i(C) we can use the same argument since to/(C) 
*-(C)9(C) and 



( 



$^2 $22 



$12 
$22 



o 


$12 ^ 


o 


$22 



o 


$12 ^ 


o 


$22 



Corollary 3.4.. Let ^ be a matrix function of the form ( p^.lD such that ||r$||e < 
||r$||. Suppose that min{mi,ni} = 1. Then there exists a unique optimal solution of 
the four block problem. 

Proof. It is sufficient to consider the case = 1. To obtain the result in the case 
nil = 1 we can pass to the transpose of $. Let / = /i © /2 be a maximizing vector 
of r$ and let Q G if°°(Mmii) be an optimal solution of the four block problem, i.e., 

$11 - Q $12 
$21 $22 ^ 

It follows from Lemma |3.3| that /i is a nonzero scalar function in H^. By Lemma |3]2 

/ $11 - Q $12 \ / /l \ _ p r 
[ $21 $22^/2^" 

Therefore Qfi is uniquely determined by r$ and / (Q is a column matrix) and since 
fi 7^ O, it follows that Q is uniquely determined by $. ■ 

Now, following ideas of [PYl] we construct diagonalizing matrix functions V and 
W in the following way. 
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Take a maximizing vector f = f\ ® f^, h e /a G ^^(C"^). Let h he a 

scalar outer function such that |/i(C)P = ||/(C)IIc'm C ^ T. Such a function h always 
exists since /i G and /i 7^ O. Denote by -i? a greatest common inner divisor 
of the entries of /i (it may happen that = 1). Define the vector f = f 1 © f 2 € 
if°°(C"i) © L°^(C"2) hj V = df /h. Clearly, ||i;(z)|| = 1 a.e. on T. The vector v will 
be the first column of the matrix V. 

We can represent the column function vi as Vi = ^(o)^(j) where f(o) is a scalar 
outer function such that |'y(o)(C)l = ll'^i(C)l|c"i a.e. on T and is an inner column 
function (the inner part of fi), i.e., ||^^(i)(C) Ilc"i = 1 a-e. on T. Applying Lemma |3 . 1| to 




V = we obtain an inner and co-outer matrix Vc such that the matrix V 
is unitary-valued. Note that the vector-function f(j) is pointwise orthogonal to any 
column of Vc a.e. on T, and so the same is true for vi. So the matrix function 

(3.1) 

is isometric almost everywhere on T. 

It is easy to see that we can complete this matrix function by adding measur- 
able column functions to obtain a unitary-valued function. Indeed it is sufficient to 
complete the matrix function to a square matrix function whose columns are point- 
wise linear independent and then apply the Gram-Schmidt orthogonalization process 
to the columns. To this end we can approximate our matrix function uniformly by 
step functions which take isometric values. Clearly, we can find a unitary completion 
for each step function. It is easy to see that if the distance from a step function to 
our initial function is sufficiently small, then the columns of our initial function and 
the columns we added to the step function are linearly independent. 

Let ^ be a unitary- valued completion of the matrix ( p.l|) . Then V has the form 

V -- 







V2 





Let us now construct a unitary-valued matrix in a similar way. Let tog = r$/. 



Then by Lemma pl2| , ||5'(C)|| = ||/(C)II = 1^(01; C ^ T. Let r be a greatest common 



inner divisor of all entries of zg^ (recall that g = gi(Bg2, Qi G i/^(C"*^), g2 G L^(C™^)). 
Define the column function w = wi © W2 G H°°{C^^) © L°°(C'"2) by w = zfg/h. By 



analogy with (|3.1| ) we can find a co-outer matrix function Wc such that the matrix 
function ( | takes isometric values on T. We can complete this matrix 

\ W2 O J 

function to a unitary-valued matrix function and define W to be its transpose: 

W' -- 



Wl Wc 




W2 
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To prove Theorem p.l| we shall proceed as follows. Let Qo be an optimal solution of 
the four block problem. We shall prove in the next section that the matrix function 

/ $11 - Qo $12 ^ 
V $21 $22 



admits a representation 



$11 - Qo 


$12 


$21 


$22 



w* 


( toUo 

O 


O 
^11 


o 

$S^2^ 




^ O 


^21 


$S 



y* 



where uq = zdrh/h and $i\^ is a matrix function of size (mi — Ij x 
shall also prove in Section 4 that if Q is another optimal solution, then 

\ 



rii 



We 



$11 -Q 


$12 ^ 


$21 


$22 



w* 


( toUo O 
O $il) - Qi 


O 

$S 




I o $^V 


$g 



where Qi G H^{M^^ -i,ni-i) and 



(1) 



11 

^21 



Qi $ 



(1) 

12 



^22 



Since V and W are unitary- valued, it is easy to see that Q is a superoptimal solution 
to the four block problem for the matrix function $ if and only if Qi is a superoptimal 
solution to the four block problem for the matrix function 



$(1) 



Moreover, if to.ti, - ■ ■ .td-i is sequence of superoptimal singular values of the four 
block problem for $, then ti, ■ ■ ■ , td-i is the sequence of superoptimal singular values 
of the four block problem for This reduction allows us to diminish the size of 
the matrix function $ii. 

If r^(i) = O, we clearly have uniqueness. To continue this process we have to be 
able to find a maximizing vector for the four block operator r${i) . We can certainly do 
that if its essential norm is still less than the smallest nonzero superoptimal singular 



value. In Section 6 we shall prove that ||r 



${1) I 



continue the process and reduce Theorem to Corollary 



which will allow us to 



4. PARAMETRIZATION of optimal SOLUTIONS 
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In this section we describe the optimal solutions of the four block problem in case 
when ||r$||e < ||r$||. 

Lemma 4.1.. Let ^ be a block matrix function of the form (|2.1| ) such that ||r$||e < 
||r$||, and let V and W be the matrix functions constructed in Section 3. Then there 
exists a unimodular function uq such that any optimal solution Qo of the four block 
problem satisfies 

\ 



$11 - Qo 


$12 ~ 


$21 


$22 



w* 


( toUo 

O 


O 
^11 


o 

$1^2' 




^ O 


^21 


$g 



y* 



(4.1) 



where is a matrix function of size mi — 1 x ni — 1. The unimodular function 

Uq admits a representation uq = zbh/h, where h is an outer function in and b 

is a finite Blaschke product. Moreover, the Toeplitz operator T^^ is Fredholm and 
Range T„o = i^2. 

Proof. Let / = /i 0/2 be a maximizing vector for r$ and let r<i,/ = t^g = to(5'i ©5'2)- 
Put uq = z-dfh/h (see the construction of the matrix functions V and W in Section 3). 

By Lemma [X^, /(C) is a maximizing vector for ^ '^^^''^'^ (C^"'"'''' $^^(C) ) 
everywhere on T and 



$11 -Qo $12 
$21 $22 



/ = tog. 



Therefore j(C) .s a ■r,axi,r,i.„rg vector for ( " 



C, eT and so 



$11 - Qo 
$21 



$12 
$22 



9 = tof- 



for almost all 



(4.2) 



Since f = dh 



V2 



dh 



and zg = rh 

' $11 - Qo 
$21 



^ W2 

$12 ' 
$22 . 



we have 

'V2 ) 



tozrh 



Wi 
W2 



It follows from the definition of the matrix functions V and W (see Section 3) that 

/ 1 \ / 1 \ 

^ $11 - Qo $12 ^ 



21 



V 



22 



tozfhW* 
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It is easy to see that the first column of W 
( touo O ■■■ (i 



$11 - Qo $12 
$21 $22 



V has the form 



first row of W 
that 



, where mq = z'drh/h. Similarly, using ( [4 .21 ) we find that the 
^ \V has the form ( toUQ O ■ ■ ■ O ), which proves 

has the form (|4.1|). 



$11 - 

V $21 

$11 - Qo $12 
\ $21 $22 ^ 

Let us show that the Toeplitz operator T„g is Fredholm and is onto. Clearly, 



"0 I 



1, since \\HuM2 = \\^-Z^f\\2 = \\z^f\\2 = 
Indeed, let / be a scalar function in H^. We have 

r^vf = p. 



2. We claim that L < 1- 



$11 - Qo $12 
$21 $22 



V2f 



( toUo 



( toUo 



\ 



$(1) $(1) 

^11 ^12 
^21 ^22 / 



F_W* 



$(1) 
^11 

$(1) 
^21 



Therefore 



WiUof 



W2Uof 



O \ 
$ff 
$g J 



V* 
( f \ 



vif 

V2f 



WiUof 
W2Uof 



\0 J 



Wi W2 ] \ _ 



toHuof^ 



whence 



tn\\H, 



which implies that \\Hu 



uo\\c < ||t^||2||w||2||r$||e 
< 1. 



ir 



<1> c 



< tn 



Since 1 1 if, 



M() ||C 



distLoo(Mo,ii°° + C) (see e.g., [S], [Ni]), it follows that \\K, 



Mo lie 



«0 I 



lim^^oo distioo(z-'Mo, ii°°). We have = \\h\\. So dist{uo, H°°) 

Therefore there exists a j G Z+ such that 

distioo(;z%o,^^°°) = 1 and dist loo {z^+^uq, H°°) < 1. 
This means that T^j+iun is left invertible and Tjj^ is not left invertible which implies 



that T^j+iug is invertible (see [Ni]). Clearly, T, 



T T 

-J- tin : 



Ty]^-\ is Fredholm and 



so is T„o. 

Since Mq has the form uq = zdfh/h, where d and r are inner and h is an outer 
function in H^, the Toeplitz operator has dense range (see [PKh]) which together 
with the Fredholmness of T^^ implies that T^^ is onto. 
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It remains to show that both and r are finite Blaschke products. Indeed, if k, is 
an inner divisor of i)T, it is easy to see that nh G KerT^o and since Tu^ is Fredholm, 
Ker is finite dimensional, which implies that both i) and r are finite Blaschke 
products. ■ 



Theorem 4.2.. Let ^ be a block matrix function of the form such that 

||r$||e < ||r$|| and let Qq be an optimal solution of the four block problem. Sup- 
pose that V, W, Uq, $n , $12^ $2i\ $22^ satisfy (|0| ) holds. Let Q be a matrix 
function of size nii x ni. Then Q is an optimal solution of the four block problem if 
and only if there exists Qi G if°°(Mmj_i^„^_i) that satisfies the following conditions: 



$11 -g 


$12 


$21 


$22 



w* 


( touo 




$S^2^ 




1 


$g 



V* 



(1) 



11 

^21 



Qi $ 



(1) 
12 



^22 



<to. 



(4.3) 



(4.4) 



To prove Theorem |4.2| we need the following result from [PYl]: 

Lemma 4.3.. Let V, W be L°° matrix functions on T, of types n x n, m x m 
respectively, which are unitary-valued a.e. and are of the form 

W* = 



V 



V, 



W Wr 



where v, V^, w, Wc are H°° matrix functions, v and w are column functions, and 
Vr, Wr are co-outer. Then 



Wi7°°(M^,0Vn 



L°°(M, 



m—l,n~l) 



l.n— 1 J 



Proof of Theorem |4.2| . Let Q be an optimal solution. By Lemma 

' Qo-Q 



w 



V 



has the form 



/ 



(the upper left block is scalar). On the other hand it is easy to see from the definition 
of V and W (see Section 3) that 



W 



Qo-Q 



( 



V 



Wi 



W,)\Qo-Q){vi V, 
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Therefore 



( )\Qo-Q){v, V, ) 



F 



for some F G L°°(Mm^_i.„^_i) (the upper left corner of the matrix function on the 
right hand side is scalar). Let Vi = f(o)^^(i), Wi = W(^o)W{i), where and w^o) are 
scalar outer functions, and f («) and are inner column functions. We have 



I 



( W^i) Wc ) (Qo - Q) ( ^« Vc 
wi )\Qo-Q){v, V, 



Via) O 

o / 



Via) O 
O / 



Put V = w = Clearly, the matrix functions V = v Vc j and W = 

w Wc ) satisfy the hypotheses of Lemma ^73| . Therefore F G if°°(Mmj_i 
which proves that Q satisfies with Qi = —F. Since Q is an optimal solution, 
obviously holds. 

Conversely, suppose that Qi is a function in i7°°(Mmi-i,ni-i) satisfying ( ^l3|) . Then 



it follows from Lemma |4.3| that there exists a function G G i/°°(Mmi,ni) such that 

^ O O \ 



( H^, yG[v, Vc ) 



which implies that 



-{G + Qo) 


$12 


$21 


$22 



w* 


( touo O 
O $ff - 


O 

$S^2^ 




I o $51^ 


$g 



V* 



and so g = G + Qo e H°°{Mm,,nJ- Clearly, 
solution. ■ 



implies now that Q is an optimal 

It is easy to see that Theorem [4.2| reduces the problem of finding a superoptimal 
solution for $ to the same problem for the matrix function 
a lower size. 



which has 
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5. The matrix functions Vc and Wc are left invertible in H' 



In the last section we reduced the problem of finding a superoptimal solution for 



$ to the same problem for 



dcf ^11 



$(1) 



^12 



$(1) 
^22 



If we could continue this process, 



we would eventually reduce the problem to the case min{mi,ni} = 1 and it would 
follow from Corollary |3.4| that there is a unique superoptimal solution to the four 
block problem for $. The main problem now is to prove that the four block operator 
has a maximizing vector. This is certainly the case if ||r$J|e < ||r$||e. To 
prove this inequality we use an idea of [PY2] based on the solution of the so-called 
matricial corona problem for the matrix functions Vc and Wc- However in our case the 
solvability of this corona problem is much harder than in [PY2] where Vc, Wc € QC. 

In this section we shall prove that the matrix functions Vc and Wc are left invertible 
in (in other words the corona problem is solvable for them) which we shall use 
in the next section to prove that ||r$J|c < ||r$||e. 

Theorem 5.1.. //||r$||c < ||r$||, then the matrix functions Vc and Wc defined in 
Section 3 are left invertible in H°° . 

Clearly, it is sufficient to prove that Wc is left invertible in which means that 
there exists a matrix function f2 in iy°°(M„^ „j„i) such that VL{QWc{C) = I fo^' every 



G ©. To show the left invertibility of V^ it is sufficient to apply Theorem |0| to 
the transposed function $* and use the equalities ||r$|| = ||r$t|| and ||r$||e = ||r<j,t||e, 
which follow immediately from the obvious identity 

where Jp *== zp for a vector function p in L^. 

Recall that w = Wi © W2 is the first column of W* . Denote by Wi^, 1 < r < ni, 
the components of Wi. We have wi = W(^o)W(i), where W(o) is a scalar outer function 
in and is an inner column function. 

Lemma 5.2.. The vectorial Toeplitz operator Tj^^ : iJ^(C"^) is left invert- 

ible. 

Proof. First of all, KerT^^ = {O}. Indeed, assume that G KerT^j^. Then Wi^ip G 
for 1 < r < ni. Since 1 is a greatest inner divisor of the components of Wi, it 
follows from Beurling's theorem that the functions 

ni 
r=l 

form a dense subset in H^. Therefore we can approximate ip in the L^-norm by 
functions of the form I]r=i ^r'U^i^V^, each of which belongs to Hl^ =^ G : ip{k) = 
for > 0}. Hence ip G H\ and since ip G H^, it follows that = Q. 
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If Tyji is not left invertible, there exists a sequence of scalar functions {^Pj}j>o in 
such that \\(pj\\ = 1 and O in the weak topology and ||T^^(y9„|| — > 0. By 

Lemma 141] the operator T„q is onto and so there exists a sequence {ci;„}n>o of scalar 
functions in (KerTug)-*- such that Tu^oju = ^n- Since T„q is Fredholm, ojn O weakly. 

Put pn =^ e iJ2(C"i) © L2(C"2), where v is the first column of V. Let Qo be an 
optimal solution of the four block problem for $. By ( ^.11 ) we have 



$21 $22 ' ^ 



/ toUoUJj \ 



V 



toUQUJjW = to{(pj + ipj )w, 



for some functions (pj G H^. It follows that 



\T^Pj\ 



11 



21 



12 
22 



\toF {ifj + if- )w\ 



\tQUQUJjW\ 



||toP+(^iWi|| 



<I> e 



since ||t^(C)||c"i = ||ti^(C) llc^i • 

Taking into account that ||T^y9j|| and pj O weakly, we obtain 
^0 = ||r$|| which contradicts the hypotheses of the lemma. ■ 

The next step is to prove that the Toeplitz operator T^^ is left invertible, where 
is the inner part of Wi. We need the following well known facts. Let x = {Xj}i<j<k 
be a column function in iJ°°(C'^). Then it is left invertible in H°° (i.e. there exist 
functions Kj, 1 < j < k, such that Z]j=i s(C)Xi(C) = 1 for C ^ I^)) if and only if 
the Toeplitz operator is left invertible (see [Ar]). Note that by the Carleson corona 
theorem (see e.g., [Ni]) x is left invertible if and only if inf^go HxlOllc^ > 0. This 
result was generalized in [SNF2] for the case of matrix (and even operator) functions: 
let S be a matrix function in H°°, then S is left invertible in H°° if and only if the 
Toeplitz operator T~ is left invertible. 



Lemma 5.3. 

left invertible. 



Under the hypotheses of Theorem ^[j the Toeplitz operator T^^.^ 



IS 



Proof. By Lemma |5.2| , T^^ is left invertible. By Arveson's theorem mentioned above 
wi is left invertible in We have wi = wi^o)W{i), where w^o) is a scalar outer 

function in and W(i) is an inner column function. Obviously, it follows that 
is left invertible in . Again by Arveson's theorem this implies that T^^.j is left 
invertible. ■ 

We need the following result proved in [P]. 
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Theorem 5.4.. Let W be a unitary-valued matrix function of the form W* = 
w Wc ^ , where w is a co-outer inner column, and Wc is a co-outer inner function. 

Then the Toeplitz operator T^t has trivial kernel and dense range, and the operators 

H^tH-wt and iJ(*^t-)*if(w')* are unitarily equivalent. 

The foUowing resuh can easily be deduced from Theorem |5.4| 

Theorem 5.5.. Let W be a matrix function satisfying the hypotheses of Theorem 
\5.4 - Suppose that ||i^w|| < 1- Then the Toeplitz operator T\/\^t is invertible. 



Proof. Clearly, ||/7cw*)*|| = ||-f^w*|| = ll-f^w'll- 1^ is easy to see that ||-ffw*ll = ll-^^wH < 
1. By Theorem |5.4| the operators H^tHsji^t and if*-y^t-),iJ(w*)* are unitarily equivalent. 
Therefore ||ifw*|| = ll-^{wt)*|| < 1- Since W* takes isometric values on T, it is easy 
to see that 

for every vector function F. Consequently, T-w* is left invertible if and only if 
||-?/w*|| < 1- It follows that both Tw* and T(w*)* are left invertible which means 
that Tw* is invertible. ■ 

Proof of Theorem |5.1| . Put w =^ and let W* = W{i) Wc ) • By Lemma 



]3|, T^j-j is left invertible. Since tf («) takes isometric values on T, we have as in the 
proof of Theorem |575 

for every u G H^. Hence < 1 and so by Theorem 5^ the operator Tw* is 

invertible. Clearly, it follows that T^^^ is left invertible, since Tpp^ can be interpreted 
as a restriction of Tw*. Therefore by the Sz.-Nagy-Foias theorem mentioned above 
Wr is left invertible in ■ 



6. The essential norm of r$(i) 



In Section 4 we reduced the proof of Theorem |2ri| to the fact that ||r$|.jj||c < llT^lle, 



where the matrix function 



is defined in ( [4.1|) . In this section 



we are going to use the facts that Vc and Wc are left invertible (see Section 5) to 
prove this inequality which will complete the proof of Theorem 
The idea behind the proof is the following. We use the fact that 



|r${i)||e = inf{limsup ||r$(i)^j||2}, 
j 
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where the infimum is taken over all sequences {^j} in H'^{<C^^^^) ©L^(C"^) such that 
II II 2 = 1 and — > O weakly. Given such a sequence {^j} we construct another 
sequence {pj} in H'^iV^^) © L2(C"2) such that ||pj||2 = 1, Pj ^ O weakly and 

limsup ||r$pj||2 > limsup ||r$(i)^j ||2. 
i i 

To this end we are going to use a construction which is similar to the one used in 
[PY2]. 

Let W be the unitary- valued matrix function constructed in Section 3. Consider 
the matrix W* which has the form 

W* -- 

Let 





F 




o 


G 


( 


F 


\ 


\ o 


G j ■ 



(6.1) 



To use a construction similar to the one given in [PY2] , we have to find a left inverse of 
/5 of a special form. Recall that we have proved in Section 5 that Wc is left invertible 
in H°°. Let Wl' be an left inverse of Wr. 



Lemma 6.1.. Let (3 he the matrix function defined by ( |6.1|) . Then the matrix 
function G is invertible in L°° and there exists a bounded left inverse of (3 of the form 



B 





X 


o 





(6.2) 



Proof. Suppose that G is not invertible in Then there exists a sequence {Cj}j>o 
in L2(C™2) such that ||^j||2 = 1 and WG^jh ^ 0. 
It is easy to see from Lemma K^ that 



w.iOWl-. > "^"'l' „l^'"" =S<i,(e T. (6.3) 



ir 



Since the column Wi{() is orthogonal to the columns of Wc{C) a.e. on T, it follows 
that the matrix function wi Wc ) is invertible in L°°. Therefore there exists a 
bounded sequence {rjj} in L^(C™'^) such that 

[wi Wc) Vj + F^j = O. 

Then 
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It follows from ( |6.3|) that ||w2(C) llc'^a < 1 — 5, C ^ T. Since 11^,^^112 — ^ 0, we have for 
large values of j 



W* 







< Hh < 







which contradicts the fact that W is unitary-valued. 

Let now i? be a matrix in the form (|6.2| ). Clearly B(3 = / if and only if 

Wl'F + XG = 0. 

Since G is invertible in L°°, we can always find a matrix function X in which 
satisfies this equality. ■ 

Remark. In the same way we can consider the submatrix a of the matrix V 
constructed in Section 3, 



a 



Vr 



and prove that a has a left inverse in the form 

A= 



where V^'' is an H°° left inverse of Vc- 

To prove the main result of this section we need the following lemma which in the 
case of Nehari's problem was proved in [PY2] (see Lemma 2.1 there). 



Lemma 6.2.. Lett] be a vector function in //^(C™"^ 
scalar function in defined by 



)L2(C'"2) and let x be the 



Then 



W* 



X 

7] 



mi 



m2 ^ 



Proof. Since W is unitary-valued, we have 

I = W*W = ww' + (3f3* 

and hence 

/? = (3{B(3y = (3(3*B* = {I - Ww')B*. 

Therefore 



W* 



X 
V 



{w /3) 



V 



WX + {I - ww*)B*r] = B*r] + w{x + w B*ri) 



It is easy to see from that B*ri e Hl{C"''^) © L2(C™2). Since w e © 
L°°{C"'^), it follows that w{x + w*B*r]) = wF_w^B*r] e ifi, which proves the result. 
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Remark. It is easy to see that if {rjj} is a sequence of functions in ifi(C™'^^^) © 
L^(C™'^) which converges weakly to O, the above construction produces a sequence 
of scalar functions {Xj} in Xj = —^+{w^B*rij), which also converges weakly to 
O. 

Now we are in a position to prove that ||r$J|c < ||r$||c, where the matrix function 



is defined in (14.11). 



$(1) = 

Theorem 6.3.. Let ^ be a matrix function of the form such that ||r$||e < 

||r$||. Then ||r${i)||c < ||r$||c. 

Proof Let {^j} be a sequence of functions in if!(C"i~^)©L2(C"2) such that ||^j||2 = 1 
and C,j O weakly. Put rjj = T^^m^j. We are going to construct a sequence of 

functions {^f } in H^_{C''^)®L'^{C''^) such that O weakly, and > \\r]j\\2, 

where rjf =^ As we have explained in the beginning of the section, this would 

imply the desired inequality (put pj — — - 



To this end we apply Lemma |6.2| to the sequence {f]j}. We obtain a sequence of 
scalar functions {Xj} such that 

W* ( ) ^ HliC^^) © L^iC^^). 

Put 

where v is the first column of V and A is the left inverse of a described in the Remark 



after Lemma ^TT|. The scalar functions qj will be chosen later. 
We have 

/ tQUQ O \.r^(.#_( toUaQj + toUoV*A*^j \ 

[ o $(1) J ~[ ) ■ ^ ^ ^ 

Since the Toeplitz operator T^^ is onto, we can pick qj as a solution of the equation 

F+(toUoqj + toUoV*A^ij) = Xj- 
Clearly, we may choose the qj so that qj — > O weakly. Indeed, we may put 
= to\Tu,\{KeTTu,)^)-\toXj - f+uov*A'Q. 

It follows that O weakly. 

Let us show that the sequence {^f} has the required properties. Since O 

weakly, to prove that --^ > O weakly, we have to estimate ||^J||2 from below. We 

IISj l|2 
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have 

lief 11^ = \\V*if\\l = H + v*A%\\l + IIOII^ > 1. (6.5) 

To complete the proof it remains to show that ||r$^*||2 > ||^7j||2- 

Recall that rjj = T^mr]j = p-<l>(i)^j and so <t>^^^^j - r]j e H'^{C^^^^) © {O}. It is 
easy to see from the definition of W (see Section 3) that 



It follows now from ( |6.4| ) that 
where 

UJ. 41^ P_(toUogj. + tQUQV*A^Q. 

Since the first column of W* is Wi © W2 and Wi G it follows that 

W* I ^ 
We have chosen {xj} so that 

Therefore 



m2\ 



Hence 

II2 = llx. + ^.Il2 + h.ll2 = tl\\q, + v*A'm + iir^^e,! 

Since ||r$(i)|| < to, this together with ( |6.5| ) yields 

ll^f II2 > h,||2 



lief ih - lie 



which completes the proof. 



As we have already explained, Theorem |6.3| allows us to complete the proof of 



Theorem 2.1 



Proof of Theorem |2.1| . By Theorem [4. 2| the four block for $ has a unique solution 
if so does the four block problem for ^^^^ and the superoptimal singular values of the 
four block problem for are ^1,^2, ■ ■ ■ By Theorem |6.3|, ||r$(i)||e < ||r<i,||e. If 

r^(i) = O, we certainly have uniqeness. Otherwise we can continue this process. 



APPROXIMATION BY ANALYTIC MATRIX FUNCTIONS. THE FOUR BLOCK PROBLEM 23 



Doing in this way we may stop the process if we get on a certain stage the zero four 
block operator or, otherwise, we eventually reduce the problem to the case d = 1. 
Uniqueness follows now from Corollary |3.4 

The fact that the singular values are constant on T follows immediately from the 
facts that V and W are unitary- valued and uo is unimodular, and from Lemma |3^. 



7.. Thematic factorizations and indices of superoptimal singular 

VALUES 



In this section we analyze the algorithm described in Section 3 and obtain certain 
special factorizations of superoptimal symbols of four block operators satisfying the 
hypotheses of Theorem Following [PYl] we shall call such factorizations thematic. 

In Section 3 we have constructed matrix functions V and W associated with the 
four block problem. By analogy with [PYl] we shall call matrix functions of the form 
V or W thematic functions. 

To state the result we may assume without loss of generality that ni < mi (other- 
wise we can take the transpose). 

Theorem 7.1.. Let ^ be a superoptimal symbol of the four block operator T which 
satisfies the hypotheses of Theorem \2. j| and suppose that rii < mi. Then $ admits 
the following factorization 

$ = w^wiw; ■ ■ ■ w2^iDv^^i ■ ■ ■ v;v*v* (7.1) 

where 





( toUo ■ 









tiUi ■ 







D = 


■ 
■ 

■ 


■ td-lUd-l 






V 





the Uj are unimodular functions such that the Toeplitz operator Tu^ is Fredholm and 
indTu- > 0, and the matrix functions Wj and Vj have the form 

where Vj, Wj are thematic matrix functions and Ij is the identity j x j matrix. 
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It is easy to see that the successive apphcation of the algorithm described in Section 
3 gives us a desired factorization. 

Remark. As in the case of Nehari's problem (see [PYl]) it is easy to see that if a 
matrix function admits a factorization of the form ( |7. 1|) , then it is the superoptimal 
symbol of the corresponding four block operator. 

We can associate with the factorization {TA) the factorization indices kj which are 
defined in the case tj ^ 0. We put kj = indT,^^. = dimKerT„^, . 

It was shown in [PYl] that even for Nehari's problem the indices depend on the 
choice of a thematic factorization rather than on the function $ itself. However, it 
was shown in [PY2] that for Nehari's problem with compact Hankel operator the sum 
of the indices corresponding to equal superoptimal singular values is an invariant (i.e. 
does not depend on the choice of a factorization). 

The same turns out to be true for the four block problem too, and we shall prove 
this later in Section 9. Moreover, the sum of the indices corresponding to equal 
superoptimal singular values admits a quite natural and simple geometric interpreta- 
tion. To give this interpretation we have to introduce a new object — the so-called 
superoptimal weight. 
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8. SUPEROPTIMAL WEIGHT 



Let W G L°°{Mn,n) be a matrix weight, i.e. a bounded matrix- valued function on 
T, whose values are nonnegative selfadjoint n x n matrices. 

Given a four block operator r : H^{C'^^)®L'^{C''^) H'^{C"'')Q)L'^{C"'^), ni+ns = 
n, we call a weight W admissible if 

l|r/f < (w/, /) I (W(c)/(C), /(C))rfm(c), / G H\c^^) © l2(c'^^). 

We need the following result which we call Generalized Nehari's Theorem. 

Theorem 8.1.. Given a four block operator r$ and an admissible weight W there 
exists a symbol $ ofT (i.e. an operator-valued function $ such that T = Tip) satisfying 
$*$ < W. 

If $ is a symbol of F satisfying $*$ < W, we say that $ is dominated by the 
admissible weight W. 

In the case W = c/, v G M+, this result was established in [FT], and this is 
an analog of Nehari's theorem for four block operators. In the general case the 
result follows from Theorem 1.1 of [TV], since the four block operator F acting from 
the space H'^{C"'^) © L^(C"^) endowed with the weighted norm || ■ Hw to the space 
H'^iC'^') © L2(C""2) satisfies the hypothesis of the theorem. 

For the sake of completeness we deduce Theorem p.l| from the analog of Nehari's 
theorem mentioned above. 

Proof of theorem |8.1j . Define We = W + el. Since We > el, it admits a 
factorization W^ = G*Gi;, where G^ G H°°{Mnn) is a matrix function which is 
invertible in H°° (see [R]). The weight W^ is clearly admissible, so 

||r/|| < \\GJ\\, f eH\c^-)®L\c-^), 

which is equivalent to the fact that 

||rG;Vll< 11/11, f e H\c^^) ® L\c--) , 

Since G^^ G H°°{Mn^n), we can consider the operator TGj^ as a four block operator. 
By the analog of Nehari's theorem it has a symbol such that ||^e||oo ^ 1- Then 
the function = \l/(je is a symbol of F and 

$*$e = G*'^*^eGe < G*Ge = We = W + el. 

It remains to chose a sequence {ej} converging to and such that the sequence {^e^} 
converges to a matrix function, say $ G L°°, in the *-weak topology. Clearly, $ is a 
symbol of F dominated by W. ■ 
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Definition. Let W be an admissible weiglit for tlie four block operator G. Con- 
sider the numbers 

s°°(W) = esssupsj(W(C)), 0<j<rf-l, ci = min{mi, m}. 

The admissible weight W is called superoptimal if it lexicographically minimizes the 
numbers sg°(W), sf'(W), ■ ■ ■ , s^i(W) among all admissible weights, i.e., 

s~(W) = min{s;^(V) : V is admissible}, 

s^iyV) = min{sf^(V) : V is admissible, s'^iV) is minimal possible}, etc. 
The following lemma shows that under the the hypotheses of Theorem |2.1| a super- 




optimal weight exists. However, a superoptimal weight is not unique in general. The 
lemma also shows that a superoptimal weight is nevertheless "essentially" unique for 
our purposes. 

Let Aa, a G M, be the function on M defined by 

t > a 

t < a 

Lemma 8.2.. Let T be a four block operator satisfying the hypothesis of Theorem 
Let $ be the superoptimal symbol ofT. Then 

1. $*$ is a superoptimal weight for T ; 

2. // W and W are two superoptimal weights, then Aa(W) = Aa(W") for any 
a > td-i- 

Proof. Note that $ is a symbol of F dominated by the weight Suppose that 

$*$ is not a superoptimal weight, i.e. that there exists an admissible weight W such 
that for some jo, < jo < d — 1, 

s-(>V) < s=^(<l>*$), s-(>V) = ($*$), < J < Jo- 

Let be a symbol of F dominated by the weight W. Then 

sf^i^) < sf^i^), .-(VI/) = < J < Jo, 

which contradicts the fact that $ is the superoptimal symbol of F. Therefore $*$ is 
a superoptimal weight. 

Let now W be a superoptimal weight, and let be a symbol of F dominated 
by W. Then \E' lexicographically minimizes (s[^(\E'), sf^(\E'), ...sJ^^(\E')) and so 
coincides with the superoptimal symbol $. So, for any superoptimal weight W the 
superoptimal symbol $ is the unique symbol of F dominated by W. This means that 
$*$ < VV for any superoptimal weight W. Together with the equalities s°°(W) = 
59°($*$) = t'j this implies the second part of the lemma. ■ 
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Denote by the function defined by 

, f I t, t > a 

Aa{t)'^\ - (8.1) 

a, t < a 



The following fact is an easy consequence of Lemma 

Corollary 8.3.. Let W and W be two superoptimal weights. Then Aa(>V) = 
Aa(W')/or anya>td-i. 

It is easy to see that if a = td-i and W is a superoptimal weight, the weight Aa(>V) 
is the (unique) maximal superoptimal weight. 



9. Invariance of indices 



The main result of this section shows that the sum of the indices of a thematic fac- 
torization of a superoptimal symbol does not depend on the choice of a factorization. 
To prove this fact we shall use the same construction which was used in Section 6 to 
prove Theorem |6.3| . 

Let 

> ai > ■ ■ ■ > ai 

be all the distinct nonzero superoptimal singular values of a four block operator F 
which satisfies the hypotheses of Theorem |2.1| . Let $ be the superoptimal symbol of 
r and let kj be the indices of a thematic factorization of $ of the form ( [7. 1| ) . Consider 
the sum of the indices that correspond to equal superoptimal singular values: 

kj, 0<r < I. 

{j:tj=ar} 

The following theorem is the main result of the section. 

Theorem 9.1.. The numbers Vj. do not depend on the choice of thematic factor- 
ization of $ . 



We are going to deduce Theorem ^TT] from Theorem ^ below, which describes the 
numbers u^. in terms of a superoptimal weight W. 

We say that a nonzero function ^ G if^(C"^) © L^(C"2) is a maximizing vector for 
an admissible weight W if 

Lemma 9.2.. Let T be a four block operator, W an admissible weight for T , and 
$ a symbol of V dominated by W . Let ^ be a maximizing vector for W. Then 

G i/i(c™i) © l2(c"2), I.e., = $6 
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Note that for W = cJ, c G IR+, this was proved in Lemma ^ 
Proof. We have 

mA) = WT^ai = w^-mi < ii*6ir < m,o ■ 

It follows that ||P"$^||2 < II $^2 IP, which implies the result. ■ 
Given an admissible weight W put 

g(W) = {^e i7^(C"i) © L=^(C"2) : ^ is maximizing for >V or ^ = O}. 

It is easy to see that ^ G S{W) if and only if P+W^ - rjr$^ = O, where P+ is the 
orthogonal projection onto H'^{C"'^) © L^(C"2). Therefore S{yV) is a closed linear 
subspace. Recall that Aa(W) does not depend on the choice of superoptimal weight, 
where Aq is defined in ( |8.1|) . 

Theorem 9.3.. Let T be a four block operator satisfying the hypotheses of Theo- 
rem W a superoptimal admissible weight, and $ the superoptimal symbol ofV. 



Consider a thematic factorization of $ of the form (|7.1| ) . Let kj be the indices of the 
factorization. Then for a > ai, 

J2 k, = dimS{Aam). (9.1) 

{j:tj>a} 



Let us first deduce Theorem 9.1 from Theorem W. 



Proof of theorem |9.1J . It follows immediately from (|9.1|) that 

z/o = dimS{Aa,{W)), = dim£(A,^(W)) © ^(Aa,_,(>V)), 1 < j < /, 
which proves the result. ■ 



Proof of Theorem |9.3| . It is easy to see that S{Aa{'W)) is constant on (oj+i 



So it is sufficient to prove that for < s < / 

J2 kj = dim S{Ws 

{j:tj>as} 

where Ws = Aa^(W). 

Let us prove the theorem by induction on d. 
If d = 1, factorization ( [7.1D has the form 

where 



W*DV* , 



( toUo 



D 



\ 



and II II < to (otherwise the essential norm of r$ would not be less than to)- Clearly, 
Wo(C) = c^o-^- If ^ is a maximizing vector for Wo, then it is easy to see that only the 
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first entry of V^*^ is nonzero. Tlierefore ^{Q is pointwise ortliogonal to all columns 
of V except for the first one. It follows that ^ = x^? where x is a scalar function in 
L^. Using the fact that vi is a co-outer column function, one can easily deduce that 
X G H'^. It is easy to see that 

= toUoXw. 

Since ^s a maximizing vector, it follows that $^ G i7i(C™^) © ^^(C™^). We can 
now use the fact that Wi is a co-outer column function to deduce that UqX G 
which means that x ^ KerT^y. 

Conversely, it is easy to see that if x ^ KerT^g, then ^ = xw is a maximizing 
vector, which proves that dim£(>Vo) = dimKerT^^ = /cq. 

Suppose now that the theorem is proved for d — 1. We have 

O J ' 

where is the superoptimal symbol of r^(i) , W =^ Wq, and V =^ Vq. The induction 
hypothesis implies that the theorem holds for r$(i). 

Let < s < Z and a = as- Suppose that W is a superoptimal weight for r^(i). By 
the induction hypothesis 

{j>l:tj>a} 

where the kj are the indices of the thematic factorization (|7. 1| ) . By Lemma |9.2| , 
C e E{Aa{W)) if and only if 

G if!(C™^-') ©L2(C'"^) and = (Aa(W)e,0- 

Let ^1, ^2, ■ ■ ■ ; be a basis in S{Aa{W)) and let rji, = r$(i)^t. By Lemma |6]^ there 
exist scalar functions Xm ^ ^ ^ Af, such that 



As in the proof of Theorem |6.3| we define the functions as 
where is a scalar functions in satisfying 



(recall that the matrix function A is defined in after Lemma 3J.). We have 



where as in the proof of Theorem |67 

uj, = ¥.{tQUQq, + UuoV*A*Q. 
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As we have explained in the proof of Theorem |0 



w* 



and so rjf = 

Since the matrix function W is unitary-valued, we have 



mt 



#l|2 



w* 



toUo O 



= tl\\uo{q, + v*A%)f + \\<!>^'^^' 

= tl\\q, + v*A%f + {A,{W%,^,) 

(the last equality holds because G £{Aa(W)), where W = (<|)(i))*$(i) is a super- 
optimal weight for r^(i)). 
Consider the weight V, 

v=f ^0 OA 
^ [o w )■ 

Bearing in mind that 

we can continue the above chain of inequalities: 

(the last equality holds because V is unitary- valued). Since r$^* = F~^^f = ^C,f, 
it follows from (U) that G S{Aa{W)). 

We can add now another ko linear independent vectors of £{Aa{yV)). Let Xi, - ■ ■ , 
be a basis of KerT„g. Obviously, x^v G £{Aa{yV)). Let us show that the vectors 
ii 1 ' ' ' liJii ^i"^' ■ ■ ■ i^koV are linearly independent. It is sufficient to prove that if 



X G Ker Tu^^ and xv + Y.i=i CtC* = O, then x 



N 



V\xv + Y,c,it) 



X 



L = l 



N 

t=i 



and c, = I < L < N . We have 

/ v*A'i, + qA _^ 



(9.3) 



Since the are linearly independent, it follows that Ci^ = 0, 1 < l < N, which in turn 
implies that x = O. 
This proves that 

^ k,<dimS{Aa{W)). 

{j:tj>a} 

Let us prove the opposite inequality. 

Denote by Sq the set of vectors in S{Aa{yV)) of the form xv such that a; is a scalar 
function in H^. It is easy to see that xv G So if and only if x G KerT^g. It remains to 
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show that there exists at most Y.ij>o:t >a} vectors that are hnearly independent 



def 



modulo £q. Let f]^ = F^^i. By Lemma p^ , fj^ = Put 

where 7^, are scalar functions in L^. Since the vectors are linearly independent 
modulo £0, the vectors are linearly independent. To complete the proof, it is 
sufficient to show that G S{Aa{W)). 

Since r/^ = we have that 77^ = ^'•^^^t and 5^ = to'^oTt- It follows from the block 
structure of and that ^ e i/2(C"i-i) ©l2(C"2) and tj, G i7!(C™i-i) ©L2(C'"2). 

So r], = $(1)^, = r^d)^.. 

To show that G S{Aa{yV')), consider the following chain of equalities 

{AamLi) = {Aa{vwv*)vLvi) 

= ((o Aa(VV') )(?;)'( 

On the other hand 

{AamLi) = \\<^i\\ = \m = + = n^^^^e.f + ti^f. 

Therefore (Aa(W)e.,6) = ll'^'^^^^JP , which implies G Aa(W'). ■ 



IQ. Singular values of r$ and superoptimal singular values 



Let r be a four block operator satisfying the hypotheses of Theorem Denote 
by $ its unique superoptimal symbol and consider a thematic factorization of $ of 
the form (|7.1| ). Let {tj} be the superoptimal singular values and {kj} the indices of 
the factorization. Consider the extended t-sequence for F: 

in which tj is repeated kj times. We denote the terms of the extended sequence by 
t'Q,t[,t2, ■ ■ ■ . Although the indices kj depend on the choice of thematic factorization, 



it follows from Theorem |9.1| that the extended t-sequence is uniquely determined by 
F. 

In [PY2] it was shown in the case of Nehari's problem with $ G H°° + C that 
t'j < Sj{Hij,), < j < k — 1. In this section we are going to prove the same inequality 



in the case of the four block problem under the hypotheses of Theorem 2.1. Moreover 
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we prove in this section a stronger result which is also new in the case of Nehari's 
problem with an if °° + C symbol. To prove the results we use in this section the 
same machinery as we used in Section 6. 

Let r be a four block operator that satisfies the hypotheses of Theorem |2.1| . Then 
(see Section 3) 

where the unitary-valued matrix functions V and W are defined in Section 3. The 
following inequality is the main result of the section. 

Theorem 10.1.. Let T he a four block operator such that ||r$||o < ||r$|| and let 
$ he its superoptimal symbol. Then 

Sj(r^(i)) < Sj+koiX^), j e z+. 

Recall that = dim Ker T^g . 

Let us first derive from Theorem p.0.1| the desired inequality between the singular 
values of r$ and the superoptimal singular values. 



Theorem 10.2.. Under the hypotheses of Theorem^ 

t- < s,(r$), J > 0. 



Proof of Theorem |10.2| . Let x G KerT^g. Clearly, 

T^xv = W* [touox O ■■• O)' = toUoXweHl{C"'')(SL\C"'^). 
It follows that ||r$xf II2 = to||a;f II2 which proves that 

s,(r$)=to, 0<j< A;o-l. (10.1) 
We can now proceed by induction on d. Clearly, the result holds for d = 1. It is 



also obvious that if the theorem holds for r$(i) , then by Theorem |10.1 

t'^<s,{T^), ko<j<d-l, 

which together with ( |10.1|) proves the theorem. ■ 

Proof of Theorem |10.1| . Clearly, it is sufficient to prove the following fact. Let 

£beasubspace of if^ (C"i-1)©l2(C"2) such that ||r$(i)^||2 > s||^||2, for every ^ e £, 
where < s < to, then there exists a subspace M of i7^(C"i) © L^(C"2) such that 
dim > dim C + ko and ||r$p||2 > s||p||2 for every p & M.. 

Let ^t, 1 < 6 < A^, be a basis in C. Put rj^ = r$(i)^t. By Lemma |6]^ there exist 

scalar functions in such that W* (^^^ ^ e fT^ (c™i) © ^^(C™^). We define the 
functions ^* E H^_{0'') © ^^(C"^) by 
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where is a scalar function in satisfying 
(see the proof of Theorem |ti.3D . 



We can now define M. by 

M = span{^f + XV : 1 < l < N, x e KerT„„}. 

Let us show that dimM. = N + k^. Since dimKerT^g = /cq, it is sufficient to 
prove that if xv + Y^^=i Ciit — then x = Q and q = 0, 1 < i < A^. This follows 
immediately from ( p.3| ). 

To complete the proof it remains to show that ||r$p||2 > s||p||2 for p = xv + 

e:=i c,ef . Let e = e:=i c.e., v = ^ik, g = e:=i e.g., and ef = c.ef. 



We have 



W* 



W* 



toUo O W x + q + v*A^^ 



It follows (see the proof of Theorem |6.3| ) that 

toUox + toUoq + toUov*A''^ 



r$p = W 

Therefore 



r$(i),^ y 
l|r$p||^ = |tonk + g + t;M*^||2 + ||,;||2. 

We have 

\\p\\l = \\V*p\\l = \\x + q + v*A'al + U\\l 
Since s < to and ||?7||2 > s||^||2, it follows that ||r$p||2 > s^||p||2. ■ 

Theorem 10.1 certainly applies to the case of Nehari's problem. Recall that a 
matrix function $ is called very badly approximable (see [PY]) if the zero function is 
a superoptimal approximant of $. 

Recall that under the condition ||-f^$||c < \\H<i,\\ the function $ admits a factoriza- 
tion 

where V and W are unitary matrix functions of the form 

V = (^v Vc) , W = [w W 



5 
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and Mo is a unimodular function such that =^ dimKerT^g > 0. 
The following result is certainly a partial case of Theorem |10.1| . 

Theorem 10.3.. Let ^ be a very badly approximable matrix function on T such 
that ||-f^#||e is less that the smallest nonzero superoptimal singular value of Nehari's 
problem. Then 

where ^^^^ and ko are given by the factorization (|10.2| ) . 
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